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Abstract. This paper is concerned with nonlinear filtering of the coefficients 
in asset price models with stochastic volatility. More specifically, we assume 
that the asset price process S = (St)t>0 i s given by 

dS t =r{e t )Stdt + v(e t )StdBt, 

where B = (Bt)t>o i s a Brownian motion, v is a positive function, and 
8 = (0t)t>o is a cadlag strong Markov process. The random process is 
unobservable. We assume also that the asset price St is observed only at 
random times < n < T2 < . . . . This is an appropriate assumption when 
modelling high frequency financial data (e.g., tick- by-tick stock prices). 

In the above setting the problem of estimation of 8 can be approached 
as a special nonlinear filtering problem with measurements generated by a 
multivariate point process (tj. , log S Tk ). While quite natural, this problem does 
not fit into the standard diffusion or simple point process filtering frameworks 
and requires more technical tools. We derive a closed form optimal recursive 
Bayesian filter for 8t , based on the observations of (r^., log S Tk )k>i ■ It turns 
out that the filter is given by a recursive system that involves only deterministic 
Kolmogorov-type equations, which should make the numerical implementation 
relatively easy. 

1. Introduction 

In the classical Black-Scholcs model for financial markets, the stock price S t is 
modelled as a Geometric Brownian motion, that is, with diffusion coefficient equal 
to aSt, where "volatility" a is assumed to be constant. The volatility parameter 
is the most important one when it comes to option pricing; consequently, many 
researchers have generalized the constant volatility model to so-called stochastic 
volatility models, where at is itself random and time dependent. There are two 
basic classes of models: complete and incomplete. In complete models, the volatil- 
ity is assumed to be a functional of the stock price; in incomplete models, it is 
driven by some other source of noise that is possibly correlated with the original 
Brownian motion. In this paper we study a particular incomplete model in which 
the volatility process is independent of the driving Brownian motion process. This 
has the economic interpretation of the volatility being influenced by market, polit- 
ical, financial, and other factors that are independent of the "systematic risk" (the 
Brownian motion process) associated with the particular stock price under study. 
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It is also close in spirit to the way traders think about volatility - as a parameter 
that changes with time and whose future value in a given period of interest has 
to be estimated/predicted. They need an estimate of the volatility to decide how 
they will trade in financial markets, especially derivatives markets. In fact, the 
notion of volatility is so important to traders that they even quote option prices in 
volatility units rather than in dollars (or some other currency). Investment banks 
also depend on modelling future volatility in order to price custom-made financial 
products, whose payoff depends on the future path of the underlying stock price. 
Recently, new contracts have been developed that directly trade the volatility itself 
(volatility swaps, for example). We plan to address the issue of pricing options 
within the framework of our model in future research. 

Estimating volatility from observed stock prices is not a trivial task in either 
complete or incomplete models, in part because the prices are observed at discrete, 
possibly random time points. Since volatility itself is not observed, it is natural to 
apply filtering methods to estimate the volatility process from historical stock price 
observations. Nevertheless, this has only recently been investigated in continuous- 
time models, in particular by Frey and Runggaldicr [5]. See Runggaldier [22] for an 
up-to-date survey. See also Elliott et al [2] for a discrete-time approach with equally 
spaced observations, Gallant and Tauchen [6] for an approximating algorithm in 
continuous time, Malliavin and Mancino [17] for a nonparametric approach, as well 
as Fouque et al. [3], Rogers and Zaire [19], and Kallianpur and Xiang [10] for 
still other approaches. There is also a rich econometrics, time-series literature on 
ARCH-GARCH models of stochastic volatility, that presents an alternative way to 
model and estimate volatility; see Gourieroux [7] for a survey. 

Our paper was prompted by Frey and Runggaldicr [5]. Like that paper, we 
assume that the asset price process S — (St)t>o is given by 

dS t =r(8 t )Stdt + v(0 t )S t dB t , 

where B = (B t )t>o is a Brownian motion, v is a positive function, and 9 = (0t)t>o 
is a cadlag strong Markov process. The "volatility" process 9 is unobservable, 
while the asset price S t is observed only at random times < t\ < t 2 < . . . This 
assumption is designed to reflect the discrete nature of high frequency financial data 
such as tick-by-tick stock prices. The random time moments Tk can be interpreted 
as instances at which a large trade occurs or at which a market maker updates his 
quotes in reaction to new information (see Frey [4] ). Hence, it is natural to assume 
that (rfc)fc>i might also be correlated with 9. 

In the above setting the problem of volatility estimation can be regarded as a 
special nonlinear filtering problem. 

Frey and Runggaldier [5] derive a Kallianpur-Striebel type formula (see e.g. [9]) 
for the optimal mean-square filter for t based on the observations of S Tl ,S T2 , ... 
for all Tk < t and investigate Markov Chain approximations for this formula. We 
extend this result in that we derive the exact filtering equations for 9 t that allow us 
to compute the conditional distribution of 9t given S Tl At, S T2 Aty ■ ■ ■ Moreover, our 
framework includes general random times of observations, not just doubly stochastic 
Poisson processes. 

We remark that while being natural, the Frey and Runggaldicr model adopted 
in this paper does not quite fit into the standard diffusion or simple point process 
filtering frameworks (cf. [15], [12], [20]) and requires more technical tools. In par- 
ticular, the general filtering theory for diffusion processes requires that the diffusion 
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coefficient of the observation process does not depend on the state process, while 
in our case the presence of 8t in the diffusion coefficient is crucial. The standard" 
filtering theory for point processes is also not applicable in the present setting since 
the observation process fa, S Ti )i>x is a multivariate process (see also Remark 2). 

It turns out that the resulting filtering equations are simpler than their coun- 
terparts in the case of continuous observations. In the latter case, the nonlinear 
filters are described by infinite dimensional stochastic differential equations. For 
example, if 9t is a diffusion process, the filtering equations (e.g., Kushner filter or 
Zakai filter) are given by stochastic partial differential equations (see, e.g., [20]). In 
contrast, in our setting, the filtering equation can be reduced to a recursive system 
of linked deterministic equations of Kolmogorov's type. Therefore, the numerical 
implementation of the filter is much simpler (see the follow up paper [1]). 

We describe the model in Section 2, state the main results and examples in 
Section 3, provide the proofs in Section 4, and present more detailed examples in 
Section 5. 



2.1. Risky asset and observation times. Let us fix a probability space (f2, T ', P) 
equipped with a filtration F = (Tt)t>o that satisfies the usual conditions (see, e.g. 
[16]). All random processes considered in the paper are assumed to be defined on 
(fi, T, P) and adapted to F. 

It is assumed that there is a risky asset with the price process S = (St)t>o given 
by the Ito equation 



where B = (B t )t>a is a standard Brownian motion and 9 = (0 t )t>o is a cadlag 
Markov jump-diffusion process in R with the generator C. To simplify the discus- 
sion, it is assumed that r(x) and v(x) are measurable bounded functions on R, the 
initial condition Sq is constant, and v(x) and So are positive. 

The process (&t)t>o is referred to as the volatility process. It is unobservable, and 
the only observable quantities are the values of the log-price process X t = \ogS t 
taken at stopping times {rk)k>o, so that To = 0,t^ < t^+i if < oo, and Tfc | oo 
as k T co. 

In accordance with (2.1), the log-price process is given by 



For notational convenience, set Xk := X Tk . Thus, the observations are given by the 
sequence (T kl X k ) k > . 

Remark 1. (Note on the reading sequence.) The reader interested primarily 
in applying our results to real data can focus her attention on Example 3.1, which 
appears to be the most practical model to work with. That example provides 
self-contained formulas for estimating the conditional (filtering) distribution of the 
volatility process. We report on the numerical results related to this example in 
the follow-up paper [1]. 



2. Mathematical model 



dS t =r{8 t )S t dt + v(9 t )S t dB t , 



(2.1) 
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Clearly, the observation process (rfc, Xk)k>o is a multivariate (marked) point 
process (see, e.g. [8], [13]) with the counting measure 

H(dt,dy) =J2 I {r k <oo}S{ Tk ,x k }(t,y)dtdy, 

k>l 

where <5{ Tfc .x fc } is the Dirac delta- function on R+ x R. 

We introduce two filtrations related to (rfc, Xk)k>o- (G(n)) n >o an d (Qt)t>a, where 

- Q{n) := cr{(r fc , X k )k<n}, 

- Q t := a(n([0,r] x r) : r < s,T G S(R)), where S(R) is the Borcl cr-algcbra on 

E. 

It is a standard fact (see III. 3. 31 in [8]) that 

g Tk =g(k), fc = o,i... 

and {Tfc} is a system of stopping times with respect to (Gt)t>a- 

Remark 2. Although Q Tk contains all the relevant information carried by the 
observations obtained up to time the filtration (Qt) t>0 provides additional infor- 
mation between the observation times. To elucidate this point on a more intuitive 
level, we note that the length of the time elapsed between and T k +i carries ad- 
ditional information about the state of 9t after Specifically, if the frequency of 
observations is proportional to the stock's volatility v(0t), t £ jTfe, Tfe+i]] , the larger 
values of i — rfe might indicate lower values of v(0t). 

2.2. Volatility process. A more precise description of the volatility process is in 
order now. Let (R,f?(R)) and (R+ x R,B(R + ) ® B(R)) be measurable spaces with 
Borel cr-algebras. The volatility process 8 = (0 t )t>o is defined by the Ito equation 

de t = b{t,6 t )dt + cr(t,e t )dWt+ [ u{6 t -,x){n e - u e )(dt,dx), (2.2) 

where Wt is a standard Wiener process and — /i e (dt, dx) is a Poisson measure on 
(R+ x R,S(R + )®S(R)) with the compensator v {dt,dx) = K(dx)dt, where K(dx) 
is a a— finite non-negative measure on (R, B(R)). We assume that EQ\ < oo, the 
functions b(t, z),cr(t, z), and u(z,x) are Lipschitz continuous in z uniformly with 
respect to other variables, and 

\b(t,z)\ + \a(t,z)\ 2 + J \u{z,x)\ 2 K(dx) <C(l + \z\ 2 ). 

It is well known that under these assumptions (2.2) possesses a unique strong 
solution adapted to F, and Ed 2 < oo for any t > 0. 
The generator C of the volatility process is given by 

Cf(x) :=b(t,x)f'(x) + ±a 2 (t,x)f"(x) 

+ f (f{x + u{x,y))-f{x)-f\x)u{x,y))K{dy). 

Before proceeding with the assumptions and main results we shall introduce 
additional notation. Set 

m(s,t)= / (r{8 u ) - \v 2 {0u)] du, 
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and 

rt 



a z (s,t) = j v\9 u )du. 

For simplicity, it is assumed that v 2 (s, t) is bounded away from zero. Let us denote 
by Ps,t(y) the density function of the normal distribution with mean m(s,t) and 
the variance a 2 (s,t): 

I (y-m(,s,i)) 2 

PsAv) ■= Af= ; ^ e "'^ ■ (2-3) 

Clearly, p is the conditional density of the stock's log-increments X t — X s given 9. 

Let T e = {Tf)t>o be the right-continuous filtration generated by (6t)t>o and 
augmented by P-zero sets from T . Denote by G k the conditional distribution of 
Tfc + iwith respect to 1 T B V G(k). That is, G k is the distribution of the time of the 
next observation, given previous history, and given 9: 

G{{dt) = P(r fe+1 G dt\T 9 V Q{k)). (2.4) 

Without loss of generality we can and will assume that G 6 k (dt) is the regular version 
of the right hand side of (2.4). 

Let iV = (N t )t>o be the counting process with interarrival times: tq = 0, {r k — 
Tfc_i)fc>i, that is 

N t = Y,I(Tk<t). 

fc>i 

2.3. Assumptions. The following assumptions will be in force throughout the 
paper: 

A.0: For every ^-predictable and a.s. finite stopping time 5, 
P(N S -N S _ ?0\g s -) =0or 1. 

A.l: The Brownian motion B is independent of (9,N). 

A. 2: For every fc, there exists a (fc)-mcasurable integrable random measure 
on B(R+), so that, for almost all uj £ O, $fc([0,Tfc(w)]) = and G e k is absolutely 
continuous with respect to <f>fc . 

Denote by <f)(T k ,t) — cj)(9 7 T k ,t) the Radon-Nikodym derivative of G k (dt) with 
respect to $k(dt), i.e. for almost every u, 

a*fc((Tfc,iJ) 

Assumption A.0 is not essential for the derivation of the filter. However, under 
this assumption the structure of the optimal filter is simpler, and in the practical 
examples important for this paper, this assumption holds anyway. In particular, A.0 
is verified if the conditional distribution G k = P(rfc+i < t\T 6 V Q{k)j is absolutely 
continuous with respect to the Lebcsguc measure 2 or if the arrival times Tfc are 
non-random. 

The following two simple but important examples illustrate the assumption A. 2. 



^Here and below T 1 V stands for the cr-algcbra generated by the a— algebras T 1 and T z . 
2 More generally, it holds if the compensator of the counting process Nt is a continuous process. 
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P(r k+ i <t\j*vg(k)) 



Example 2.1. Let (Tfc)fc>o be the jump times of a doubly stochastic Poisson process 
(Cox process) with the intensity n(8t). In this case, 

- f n(0 s )ds 
1 - e U k y s > ,t>T k 

, otherwise. 

Then, one can take $>k{ds) = ds and (f)(r k ,s) = n(#t)exp( — f* n(6 u )diij. If 
n(6t) = n is a constant, one could also choose 

$> k (ds) = nexp {n{Tk — s)\ds and ^(r^, s) = 1. 

Example 2.2. If the filtering is based on non-random observation times (e.g., 
T k = fcft, where h is a fixed time step) then a natural choice would be 

$k{ds) = 5{ Tk+1 y(s)ds and 0(r fc , s) = 1. 

For practical purposes, ^k(ds) must be known or easily computable as soon as 
the the observations (n, Xi)i<k become available. In contrast, the Radon-Nikodym 
density 4>(rk) is, in general, a function of the volatility process and is subject to 
estimation. 

We note that A. 2 could be weakened slightly by replacing G e k by a regular version 
of the conditional distribution of Tfc + iwith respect to T~. h ± _ V Q(k). The latter 
assumption would make the proof a little bit more involved and we leave it to the 
interested reader. 

3. Main results and introductory examples 

3.1. Main result. For a measurable function / on R with E\f(9 t )\ < oo, define 
the conditional expectation estimator 7r t (/) by 

n(f) := E(f(0 t )\g t ) = [ f(z)* t (dz), (3.1) 

JR 

where nt(dz) := dP(9t < z\Gt) is the filtering distribution. (Note that we omit the 
argument 9 t of / in the estimator ir t (f)). In the spirit of the Bayesian approach, it 
is assumed that the a priori distribution 

TT (dx) = P(6» G dx) 

is given. 

Let u{6 Tk } be the cr-algebra generated by 6 Tk . For t > r k , let us define the 
following structure functions: 

Mf; t, y, e Tk ) ■■= E(f(6 t ) Prk . t (y - x k )4>(T k ,t)\<j{e Tk } v g{k)) , (3.2) 

and its integral with respect to y 

^ k (f;t,e Tk ) := f ii k (f;t,y,6 Tk )dy = E(f(6 t )<f>(T k ,t)\*{e Tk }vg(k)), (3.3) 

where p and (j> are given by (2.3) and (2.5), respectively. 
If / = 1, the argument / in tp and ip is replaced by 1. 
Write 

POO 

$ fc ({r fc+1 }) := / I(t = T k+1 )$ k {dt), 
Jo 

i.e. $ fe ({r fe+ i}) is the jump of ® k (dt) at r k+1 . 
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Finally, for t > r k and a bounded function /, define 

u(f , s ^(^(/^))-7r t „(/)7r Tfc (V; fc (l;t)) 

whenever the denominator is not zero, and M k (f; t, 7r t ) = if the denominator is 
zero. 

The main result of this paper is as follows: 

Theorem 3.1. Assume A.0-A.2. Then for every measurable bounded function 
f in the domain of the generator C such that J Q E\Cf(9 s )\ds < oo for any t > 0, 
the following system of equations holds: 

1) For every k = 0, 1 . . . , 

^ +1 (/) = ^[t fc( n ; !'^ ft , -M k (f;t,n t ) } -<S>({r k+1 }). (3.4) 

2) For every k = 0, 1 . . . and t eflrfc, Tfc + i[[, 

dn t (f) = n t (£f)dt - Mk(f;t,7r t )<t> k (dt). (3.5) 



3.2. Remarks. 

- 1. Equations (3.4), (3.5) form a closed system of equations for the filter 7r t (/). 
It is often convenient and customary (see e.g. [20], [21] and the references therein) 
to write a differential equation for a measure- valued process Ht (dx) in its variational 
form, i.e. as the related system of equations for Ht(f) for all / from a sufficiently 
rich class of test functions belonging to the domain of the operator C. In our 
setting, such a reduction to the variational form is a necessity, since in some cases 
the filtering measure ir s (dx) = P(0 S € dx\G s )msLy not belong to the domain of C. 
However, in the important examples discussed below, there is no need to resort to 
the variational form. The interested reader who is unaccustomed to the variational 
approach might benefit from looking first into the examples at the end of this 
section and in Section 5, where the filtering equations are written as equations for 
posterior distributions. 

- 2. The system (3.4) simplifies considerably if 

M k (f; t, 7r t ) {t ^ +l} • <KK+i}) = 0, for all k. (3.6) 

Obviously, (3.6) holds if for all k, <f> k (dt) is continuous at t = r k+ i as in the case 
when Nt is a Cox process. In fact, (3.6) holds true in many other interesting cases, 
even when & k (dt) has jumps at all Tfc+i, as in the case of fixed observation intervals 
(see Example 5.3 below). We note then that the following separation principle holds. 

Corollary 1. Assume (3.6). Then the filtering at the observation times {r k } k >i 
does not require filtering between them; it is done by the Bayes type recursion: 

T »+ AI) n Tk (M^y))St=r k+1 y 

\y=A k + 1 ) 

- 3. Note that for high-frequency observations, even if condition (3.6) is not met, 
for all practical purposes, it may suffice to compute the volatility estimates only 
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at the observation times. In that case, one would only use the relatively simple 
recursion formula (3.4), and disregard equation (3.5). 

- 4. Clearly, the "structure functions" ip an d ip are of paramount importance 
for computing the posterior distribution of the volatility process. We would like to 
stress that these do not involve the observations and could be pre-computed "off- 
line" using just the a priori distribution. Then, "on-line" , when the observations 
become available, one needs only to plug in the obtained measurements (r k ,X k ), 
and to compute Ttt(f) by recursion. This feature is important for developing efficient 
numerical algorithms. 

- 5. Note also that for almost every u> £ fl, filtering equation (3.5) is a linear 
deterministic equation of Kolmogorov's type, rather than a nonlinear stochastic 
partial differential equation. The latter is typical of the nonlinear filtering of diffu- 
sion processes. The well-posedness and the regularity properties of equation (3.5) 
are well researched in the literature on second order parabolic deterministic integro- 
diffcrential equations (see e.g. [14], [18], [11] and the references therein). 

Example 3.1. ( Volatility as a Markov Chain.) Let us now assume that the counting 
process is a Cox process with intensity n(9 t ), and take 4>(r k , s) = n(9 t )e T <> nl ^ 9u ^ du 
and $fe(c?s) = ds. Also assume 9 = (9t)t<T is a homogeneous Markov jump process 
taking values in the finite alphabet {ai, . . . ,om} with the intensity matrix A = 
|| A(cii, Oj)|| and the initial distribution p q = P(#o = a q)i ? = lj • • • > M. (This is one 
of the two models of the state process discussed in [5].) In this case, 

Cf(9 s )=Y,M0 s ,a 3 )f(a 3 ). 

3 

Denote by 9\ the process 9 t starting from Oj, and 

p 3l (t) := P(9 t = ai \6 = a ), ir 3 {t) = P(9 t = aj \9 t ), 
rji (t,z) :=E(e-S>^^ t {z)\6i= ai ), 

where p 3 t (z) is obtained by substituting 9 3 s for 9 S in p 1 (z). It follows from Theorem 
3.1 (for details see Example 5.1 ), with f(9 t ) := I{g t=ai }, that 

, , n{ai) rji(r k - T k -i,X k - X fe _i)p J - i (r fe - Tk-i)irj(rk-i) .„ 

Ki(T k ) = — r r r -. r . (6.7) 

2^ it j n(ai)rji(Tk - Tk-i,X k - X k -i)p 3l (T k - rfc_i)7rj(Tfc_i) 

This recursion can be easily computed, once one computes off-line the values ry. 
This example is also treated in more detail in Section 5. 



4. Proofs 

In the proof of the main result we want to show that 

dnt(f) = -Kt{Cf)dk + dM t) 

where M t is a martingale, and then we find a (integral) martingale representation 
of Mt with respect to the measure \i — v, where v is a compensator of /i. We first 
find the compensator. 
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4.1. (£/t)-compensator of \x. Denote by V(G) be the predictable er-algebra on 
x [0, oo ) with respect to Q and and set 



V(G)=V(G)( 

A nonncgative random measure u(dt, dy) on V(G) is called a V(G) -compensator 
of /i if for any 'P(C/)-measurable, nonnegative function <p(t,y) = ip(uj,t,y), 

(i) / / (p(s, y)v(ds, dy) is 7 3 (0)-measurablc 
Jo Jr 

/•OO /> />oo /> 

(ii) E cp(t,y)fi(dt,dy)=E hp(t,y)v(dt,dy). 

Jo Jm Jo Jr 

Let G k (ds, dx) = G k (uj, ds, dx) be a regular version of the conditional distribution 
of (rfe + i,Xfc + i) given Q (k) ( it is assumed that G k ([0,T k ],dx) = 0): 

G k (dt,dy) = dP(r k+1 <t,X k+1 <y\G(k)). (4.1) 

Denote G k {ds) = G k (dt,M), that is, G k {t) = P(r k+1 < t \ Q(k)) (with probability 
one). 

By Theorem III. 1.33 [8] (sec also Proposition 3.4.1 in [16]), 

We now derive a representation, suitable for the filtering purposes, of the V(G)- 
compensator v in terms of the structure functions (3.2), (3.3), and the posterior 
distribution of 9 . 

Lemma 4.1. The V '(G)— compensator v admits the following version: 

u(dt,dy)=^Ij Tk , Tk+1 j(t) ^^[fj,, M dt)dy. (4.2) 
iTo h K Tk (ip k (l;s))<£> k (ds) 

Proof. By A.l for t > T k , with probability 1, 
P(r fc +i <t,X k+1 <y\F 6 VG(k)) 

= E(p( Tk+l < t,X k+1 < y\T 9 V Q(k) V a(r k+1 )) \T 9 V Q(k) 
= E (hr k+1 <t)P{Xk+i < y\T e V G (k) V a (Tk +1 ) )\T e V G(k) 
= E (l( Tk+1 <t) £ Pr k , Tk+1 ( z ~ X k )dz\T e V £(fc)) 

[ V p Tk Az-X k )dzG e k (ds), 

where we recall that G k is a regular version of the conditional distribution of 
Tfc + iwith respect to T B V G(k). Thus, by A. 2, for t > r k , with probability 1, 

P(r k+1 <t,X k+1 <y\F e VG(k)) 

p TktS (z - X k )(f>{T k ,s)dz<S> k {ds). (4.3) 
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By (3.2), using notation (3.1), we see that 

E(E[<f>(T k ,s)p Tk>s (z - X k )\a{8 Tk } V G(k)] \Q(k)) = 7^(^(1; s, z)). 
This, together with (4.1), yields, recalling definition (4.1), 

G k (ds,dz) = 7r Tfc (Vfc(l; s, z))$ k (ds)dz. 
In the same way, for t > r kl with probability 1, 



G fc ([t >0 o],M) = J 7^(^(1; «))**(ds). (4.4) 

This completes the proof. □ 

Remark 3. If the right hand of (4.4) is zero, then P(r k+ i > t\Q{k)) = 0. 
Hence, h Tk ,T k+1 \{t) = with probability 1 and, by the 0/0 = convention, the 
corresponding term in (4.2) is zero. 

4.2. Semimartingale representation of the optimal filter. In this section we 
will prove the following result. 

Theorem 4.1. For any bounded function f from the domain of the operator C 
such that gE\Lf(0 s )\ds < oo for all t < oo, the differential of the optimal filter 
7r s (/) is given by equation 

dTTs(f) = TT s (Cf)ds (4.5) 

+ L ( § wj « li s s ; | - (/>) <** - ") ^ ^ ■ 

Proof. It suffices to verify the statement for twice continuously diffcrcntiable func- 
tions / with /, /'/" bounded. By Ito's formula, 



f(0t) = f(e o )+ / Cf(6 3 )ds+ / /'(0>(0 fl )dW s 
Jo Jo 

f{e s -)u{e 8 -,x)(jjL e -v s ){ds,dx). 



Denote 

L t = / f'(9 s )a(e s )dW s + I I f'(9 s -)u(8 s _,x)(p e ~v e )(ds,dx). 



o 



Then, we have 

7r t (/)=£(/(0„)|ft) 



Set 



+ Cf{6 s )ds\g}j +E{Lt\Qt). 

M t = {E(f(e )\g t )-7r (f)} 
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Obviously, the process E[f($o)\Qt) — tto(/) is a C/ t -martingale. Process L t is a 
jF t -martingale. Since Gt Q Ft, for t>t', 

E(E{L t \g t )\g t ,) = E{E{L t \Fv)\9v) = E{L V \Q V ). 

Consequently, E(L t \Qt) is a martingale too. 

Finally, E( J q Cf(6 s )ds\Qt) — f Q ir s ((£f))ds is also a C/t-martingale. Indeed, for 
t > s > t', we have E{v s (Cf)\Q v ) = E(£f{9 s )\g t >) which yields 



E 



eIJ^ Cf{6 s )ds\gA - J n s (£f)ds 



J £f(0s)ds 



I TTs(Cf)ds. 



Thus, M t is a £/ t -martingalc. In particular, this means that 7Tj(/) is a Q - 
semimartingale with paths in the Skorokhod space D)[o ]00 )(R), so that 7r t (/) is a 
right continuous process with limits from the left. By the Martingale Representation 
Theorem ( see e.g. Theorem 1 and Problem l.c in Ch.4, §8. in [16]), 



H (s,y)(fj, - v)(ds,dy). 



It is a standard fact that P(N S - N S - ^ 0\Gs-) = v({S},R+). Hence, due to 
assumption A.O, by Theorem 4.10.1 from [16] (see formulae (10.6) and (10.15)), 

H(t,y) = Ml(AM\V(g))(t,y), 

where AM t = Mt — Mt- and the conditional expectation M^(g|7 :, (tJ)) is defined 
by the following relation (see, e.g. [16], Ch. 2, §2 and Ch. 10, §1): for any 
■p(C?)-mcasurablc bounded and compactly supported function ip(t,y), 

E v(t,y)g t n(dt,dy) = E cp(t,y)Ml(g\V(g))(t,y)v(dt,dy). 

Jo Jm Jo Jr 

By Lemma 4.10.2, [16], 

K(Mf)\ng))(t,y) = MZ(f\V(G))(t,y). 



(4.6) 



BlllCC, 7T t _ (/) is P(£)-measurable (which implies M^(w-(f)\V(g))(t,y) = ir t -(f) ), 
by (4.6), 



Ml(AM\V(g))(t,y) 



{n(f)-n-(f)\P(G))(t,y) 



= Ml(f\V(g))(t,y)-n t -(f). 
To complete the proof one needs to show that 

^r k (^k{f;s,y)) 

']r fc ,T fc+1 ]^, 

fc>0 



M^(/(^.)|^))( S ,y) = E J h^ +1 ]( s ) 



n Tk ('>Pk(l;s ) y)) 



(4.7) 
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To prove (4.7), it suffices to demonstrate that for any 'P(C/)-measurable bounded 
and compactly supported function tp(t, y), 

t^Q J{r k ,r k + 1 ]n(r k ,oo) Jr *r k W>k(l; t, y)) 

= eJ^ J <p(t,y)f(0 t )n(dt,dy). (4.8) 

By monotone class arguments, we can assume that ip(t, x) — v(t)g(x), where v(t) 
is a "P(C7)-measurable process and g(x) is a continuous function on R. By Lemma 
III. 1.39 [8], since v(t) is V(Q)— measurable, it must be of the form 

oo 

«(*) = v + $>k(*)^,7vn](*)> ( 4 - 9 ) 

k>l 

where vo is a constant and v k (t) are Q (k) ® B(M+) -measurable functions. 

Owing to (4.9) and Lemma 4.1, in order to prove (4.8), it suffices to verify the 
equality 



E 



(r fc ,r fc + 1 ]n(r fc ,oo) Jw. *T k {Vk *, V)) 



= E [v k (T k+1 )g(X k+1 )f(8 Tk+1 )l {Tk+1<oo} ] , (4.10) 

The next step follows the ideas of Theorem III. 1.33 [8]. We have 

E [v k (T k+1 )g(X k+1 )f(9 Tk+1 )l {Tk+1<oo} ] 
= E [E(v k (r k+1 )g(X k+1 )f(9 Tk+1 )l {Tk+1<oo} \G(k) V T 6 )] 

= E ( f f v k ( S )g(y)E [f(6 s )G e k (ds, dy) \Q (k)} j , 



where, as before, G e k (ds, dy) is a regular version of the conditional distribution of 
(Tfc+i, Xk+i) with respect to T e V Q(k). 

By Fubini Theorem, and recalling notation (4.1), 



E\ / v k (s)g(y)E [f (9 s )Gi(ds , dy)\G (k)} 

\J(T k ,oo)Jm J 

w ( f f , ^ ( , E[m)Gj(ds,dy)\g(k)] 

=^fr + 7^(^)- [/( ' s)Gf(dM2/)l " (fc)n 



T k 



By (4.1), 



G fc ([*,oo];R) )' 
G k (ds,dy) = p Tk ,s(z - X k )(j)(T k , s)$ k (ds)dy. 
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Hence, for s > r k , 

E[f(e s )G 9 k (ds,dy)\g(k] 

= E(E {f(6 s )p TktS (y - X k )4>{T k , s)\a{e Tk ) V Q{k)) \Q{k)) <Z> k (ds)dy 
= K Tk (*fJk(f;s,y))dy§ k (ds). 
This, together with (4.4), yields 

B U J- G k {[8,00m ) 

so that (4.10) is satisfied, and the proof follows. □ 

4.3. Proof of Theorem 3.1. In this section we show that Theorem 3.1 follows 
from Lemma 4.1 and Theorem 4.1. 

Proof. Firstly, we note that the stochastic integral in the right hand side of (4.5) 
can be written as the difference of the integrals with respect to fi and v. Indeed, 
since / is bounded, this follows from [8], Proposition II. 1.28. 

By applying Lemma 4.1 and integrating over y one gets that for t E]Tfc, Tfc+1], 

C(T h ,t] V7r r fc (V'fe(l;S,y)) / 

K Tk (4>k{f;s)) -n s -(f)n Tk (^(^,s)) ^ 
{ruA 7r Tk (ip k (l; u))^ k (du) 

This equation verifies that (3.5) follows from the semimartingale representation 
(4.5), for t between the consecutive observation times. 
For the jump part (3.4), we note that 

/ / n s -(f)fj,(ds,dy) = ^ ^(rk+O-Cf) 

JO JR ^ <:* 

and 

r n Tk (ip k (f;s,y)) \- K Tk (ipk(f;s,y)) 



Jo Jr x Th {i>k{i-;s,y)) 



Tk + l<t \y=X k+1 l 

Now, (4.5) can be rewritten as follows: 



7Tt(/)=7To(/)+ / n s (Cf)ds 

Jo 

Ttr k (?l>ktf;s,yj) 




7rr fe (^fe(l;s,y))( s =^+i) 
Mk(f;s,TT s )$ k {ds). 



*W)-Cf) (4.11) 



fc>0 



(T fc ,tAr fc+ i] 
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Suppose t e]7fc,Tfc+i[. Then, 



It follows that 

1"(r fe+1 )-(/) 



+ / ir s (£f)ds- / X fe (/; S ,^ s )$ fc (cis). 



7T S (Cf)ds 



( T fc+l)- 



•M fe (/;s,7r s )$ fc (ds). 



Therefore, from (4.11), 

m = KT k &k(f;s,y)) 

iv—^k+i) 

This completes the proof. 



- M k (f; t, 7r t ) {t=Tfc+l} $({r fc+1 }). 



□ 



5. Examples 

In this Section we consider some important special cases of Theorem 3.1. 

Example 5.1. (Markov chain volatility and Cox process arrivals.) Recall the setting 
of Example 3.1 and its notation nj, ^j(t), and 9 3 . It follows from Example 2.1 that 
in this case § k ({r k +\}) = for all fc's. Hence the second term in the right hand 
side of equation (3.4) is zero. By (3.2), for f(0 t ) = l{e t = ai } and t > T k , 



M;t,V,8-n.) = n{ai) [E(l {6t=ai} e- SX e ^ du p S)t (y - x)\9 a ) 
Thus, owing to the homogeneity of t , for t > 7>, 

= Y / n(a i )E(l { g t=ai} e-^ n ^ du p Sit (y-x)\e s = a j 



{x=X k } 



3 
3 

Yn(aj)rji(t -r k ,y- X k )pji(t - T k )irj(T k ) 



{x=X k } 



\x=X k S 

7Tj(r fe ) 



s=r k \Kj(Tk) 



r s=r k i ' 

\x=X k J 



Similar formula holds for the denominator of the first term of the right hand side 
of the equation. Now equation (3.7) follows from (3.4). 

Repeating the previous calculations and using the notation 

r jt (t) :=E(e-fo^ du \ei = ai ), 
it is readily checked that, for t > Tfc, 
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and 

ir Tk (V>fc(l, t)) = ^2 Kj(Tk)n(ai)fji(t - T k )pji(t - r fc ) 

i,j 

which are needed in computing (3.5). It is easily verified that in the setting of this 
example, equation (3.5) reduces to the following: 

dir^t) = ^\(a j ,ai)n j (t)dt + D(T k ,t)n i (t)dt + D i (Tk,t)dt, (5.1) 

3 

where 

. . n(ai) Ej rji(t - Tk)Pji(t - Tfc)7Tj (r fc ) 

D t {T k ,t) = - 



D(r k ,t) 



JT Zrij n(a,i)r Tl (s - T k )pji{s - T k )irj{r k )ds 
Eij n{ai)fji{t - T k )p 3 i{t - T k )-Kj{T k ) 



It n(a,i)rji(s - T k )pji(s - T k )%j(T k )ds 

Note that equation (5.1) is considered for a fixed co and t > T k (ui). Therefore, r k 
and tt. (rfc) should be viewed as known quantities. 

Example 5.2. (Poisson arrivals.) Suppose that the interarrival times between the 
observations are exponential with constant intensity n(9) = A. In other words, Nt 
is Poisson process with constant parameter A. In this case, the volatility process 9 
is independent of N t . Then, on the interval r k < t < r k+ \, equation (5.1) reduces 
to 

d-Ki{t) = X(aj,ai)TTj(t)dt 

3 

- ~ T fc)^(^) - n(t))dt. (5.2) 

3 

On the other hand, owing to the independence of N and 9, it is readily checked 
that on the interval r k < t < r k +i, 



Therefore, the filtering equation (5.2) is simply the forward Kolmogorov equation 
for 9. 

A similar effect appears also in the following example. 

Example 5.3. (Fixed observation intervals.) Assume for simplicity that the Markov 
process 9t is homogeneous. Also assume that r k = kh, where h is a fixed time step. 
Notice that 

Qt = Q{k) for any t e [[r fc ,r fc+ i[[. 
Denote by P(t, x, dy) the transition probability kernel of the process 9 t , given that 
9 = x, and let T t denote the associated transition operator. 

In accordance with Example 2.2, one can take cf>(T k ,t) = 1 and <f> k (dt) = 
S[ Tk+1 y(t)dt. Thus, we get 

Mf;t, y, 0r k ) = e [f(9 t )p Tk , t (y - x k )\a{9 Tk ) v g{k)\ , 

i> k (f-t,6 Tk )=T t - T J(9 Tk ):= ( f(y)P(t-T k ,9 Tk ,dy). 
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Since & k (dt) = on Jrfe, Tfc+i [[, (3.5) is reduced to the forward Kolmogorov equation 

subject to the initial condition 7r Tfc (/). The unique solution of this equation is given 
by 7it(/) = 7r Tfe (Tt_ Tfe /), t < r k+1 . Hence, 

Since (j}(r k ,t) = 1, the denominator of ./Vf*, is equal to 1 when £ = Tfc+i. This 
together with the formula ^({r k+ i}) = 1 yields 

M k (f;t,n t ) t=Tk+ M{r k+ i}) = n Tk (T h f) - 7r Tfe+1 _(/). (5.3) 

Owing to (5.3), we get M k (f; t, n t ) t=rk+1 <f>({T k+1 }) = 0. 
This yields the following recursion formula: 

m = KT k {$k{f;t,y)) 
Tk+lU> 7j Tk (Ml;t,y)) 

V= X ^k+l 

_ J R E (f(°t-T k )pa.t-T k (y- z )\°o = z)n Tk {dz) 

S R E(po,t-T h (y-z)\Oo = z)ir Tk (dz) t=r k+i ' 
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